Mechanical properties of the domains of titin in a Go-like model 
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Comparison of properties of three domains of titin, II, 127 and 128, in a simple geometry-based 
model shows that despite a high structural homology between their native states different domains 
show similar but distinguishable mechanical properties. Folding properties of the separate domains 
are predicted to be diversified which reflects sensitivity of the kinetics to the details of native struc- 
tures. The Go-like model corresponding to the experimentally resolved native structure of the II 
domain is found to provide the biggest thermodynamic and mechanical stability compared to the 
other domains studied here. We analyze elastic, thermodynamic and kinetic properties of several 
structures corresponding to the 128 domain as obtained through homology-based modeling. We 
discuss the ability of the models of the 128 domain to reproduce experimental results qualitatively. 
A strengthening of contacts that involve hydrophobic amino acids does not affect theoretical com- 
parisons of the domains. Tandem linkages of up to five identical or different domains unravel in a 
serial fashion at low temperatures. We study the nature of the intermediate state that arises in the 
early stages of the serial unraveling and find it to qualitatively agree with the results of Marszalek 
et al. 



I. INTRODUCTION 



The way proteins are able to sustain mechanical stress 
has recently attracted large interest and promoted the 
development of new tools, both experimental and the- 
oretical, to study mechanical unfoldingiSi 3 . One model 
system largely used for such studies is titin, a giant mod- 
ular protein specific for vertebrate muscle. A single-chain 
titin molecule forms a filament up to \-1\xm long which 
connects the edge of the sarcomere, the basic unit of the 
muscle fibrils, with its middle (for reviews see Ref. 4,5,6,7 
). These connections provide both a molecular ruler 
that determines the exact length of the sarcomere and 
a template for interactions with other proteins involved 
in muscle ultrastructure and regulation^ One of the most 
important functions of titin is to act as a spring which 
confers passive elasticity on sarcomers&2ii2iiiii£. 

Two sequence motifs are present in the I-band, the 
elastic region of titin: a long stretch rich of prolines, 
glutamic acids, valines and lysines (PEVK motif) and 
up to (depending on the isoform) 100 copies of tandem 
bead-like globular domains whose fold belongs to the im- 
munoglobulin (Ig) superfamily. 13 Elasticity is thought 
to result from the interplay of these two elements act- 
ing as molecular springs placed in series, in which most 
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of the tension is provided by the largely unstructured 
PEVK motif, whereas the Ig domains could provide an 
additional contribution through a reversible unfolding 
mechanism li 4 * 1 ^ Most of the recent studies of titin elas- 
ticity have concentrated on the Ig element taking the 
27th Ig module of cardiac titin I-band (127) as a repre- 
sentative model system. Its three-dimensional structure 
has been determined - it is an 8 strand (3 sandwich with 
two anti-parallel /3-sheets packed against each other and 
held together by a tight hydrophobic core^ The N- and 
C-termini point to opposite directions, thus making the 
motif particularly suitable to a sequential assembly in a 
filament. Thermodynamically, 127 is highly stable both 
against the thermal and the chemical unfolding>iLi^ The 
folding pathways both of the isolated 127 and of a ho- 
mopolymer constructed by tandem 127 repeats have been 
characterised in detail experimentallyi2*i2i22i2I Molecu- 
lar dynamics (MD) studies and other simulations have 
provided further details into the mechanism of unfolding 
suggesting that detachment of the A-strand is the earliest 
step in forced unfoldingi 22 i 23 i 24 i 25 i 26 

The folding, elastic, and thermodynamic properties of 
of 127 have also been studied within the Go modeli2£i22i2£ 
Go models 30-31 are constructed based on the knowledge of 
the native structure and are coarse-grained. In their sim- 
plest version, the protein is represented by the locations 
of the C Q atoms. This modelling is geometry-based and 
is implemented by choosing effective couplings between 
the C a s in a way that the ground state of the system 
coincides with the native conformation of the protein. 
This approach is less realistic than all-atom simulations 
but it offers many advantages. It allows one for stud- 



2 



ies of: a) folding and stretching within the same model, 
b) tandem arrangements of many domains, c) ranges of 
control parameters such as temperature, T, d) more re- 
alistic pulling speeds, v p , and e) differences and similar- 
ities between various proteins within one framework. It 
also highlights the link between native structure and the 
properties of a protein. However, it is expected that the 
further away from the native structure a conformation is, 
the more approximate the description becomes. 

The focus of this paper is to consider two other Ig do- 
mains from the elastic region of titin, II and 128, and 
to compare in silico their mechanical, kinetic, and ther- 
modynamic properties to those of 127 within the same 
theoretical framework. The three domains correspond to 
distinct sequences and their level of identity is low - it 
ranges between 30% and 40%M- 

The structure of 128 is available from homology based 
modelling, 3 - 2 " whereas the native structure of II has been 
determined experimentally by x-ray crystallography. 33 
The thermodynamic stabilities of the domains have also 
been measured and show a remarkable difference between 
II and 127 (which are very stable) and 128 (the most 
unstable) >ii The refolding kinetics of 128 also has been 
found to be about three orders of magnitude slower than 
that of 127. 3 Despite its low thermodynamic stability, the 
experiments show that 128 is mechanically more stable 
than I27. 3 Hcrc, we produce a number of homology-based 
models of 128 and compare their behavior. However, in 
most of the paper, we focus on the model derived in^ 
We show that none of the structures generates a full qual- 
itative agreement with the experimental findings, at least 
when analyzed within the dynamical framework provided 
by a simple Go-like model. 



II. SOURCE OF THE STRUCTURES USED 

The experimental structures of II and 127 have been 
deposited in the Protein Data Bank 34 We shall refer to 
them hereon as Il-lglc and 127-ltit. The II domain 
consists of 98 residues whereas 127 of 89 residues. The 
other two structures have been determined by homology 
modelling 32 and will be denoted as Il-model and 128- 
model. 128-model was obtained using the MODELLER 
program.— The Il-model is the structure of a mutated 
sequence of Il-lglc at a single position Gly-71-Ala. This 
mutation occurs in the well exposed loop region, denoted 
by H in the lower left panel of Figure 1, and is not ex- 
pected to affect the thermodynamic or mechanical sta- 
bility of the domain. It is probably an inconsequential 
mutation that occurs spontaneously in the system used 
in crystallographic studies. The root mean square devia- 
tion between this structure and the experimental Il-lglc 
is about 1.16 A. Ribbon representations of the native 
structures of the domains studied here are shown in Fig- 
ure 1. 

At the end of the paper, we shall discuss other 
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FIG. 1: The structures of the domains of titin studied in this 
paper. The labelling of the /3-strands (symbols A through G) 
is indicated, together with the allocation of the amino acids 
to the structures. The fragments corresponding to the a-helix 
are denoted by H. 

homology-based models of the 128 domain. These 
are denoted by 128- A through I28-E. 128- A and I28-B 
were obtained by the PSQPIR routine in the WHATIF 
program. 3 - The templates used here were ltit.pdb and 
ltiu.pdb respectively - the former represents the average 
structure of the NMR bundle and the latter is the first 
structure of the NMR bundle (this structure is the best 
in terms of the internal energy). I28-C and I28-D used 
the same templates but were produced by the automatic 
Swissmodel webserver. 37 Finally, I28-E was produced by 
the Swissmodel webserver using the ltlk.pdb template. 
The alignment, produced by clustalx^ was in any case 
straightforward since the sequences of 127 and 128 have 
the same length and do not require insertions/deletions. 
The sequence of telokin (ltlk) can also be structurally 
aligned to 127 producing a unique alignment. 



III. MODEL AND METHOD 

We perform molecular dynamics simulations of a con- 
tinuum space Go-like model. The coarse-grained charac- 
ter of our theoretical description was motivated by the 
desire to deal with models that allow for studies of fold- 
ing. The details of our approach are described in RefiiSi 4 ^ 
with refinements as presented in Ref4i. Each amino acid 
is represented by a point particle of mass m located at the 
position of the C a atom. The interactions between amino 
acids are divided into native and non- native contacts. 
We follow a procedure given in Ref42i and determine the 
native contacts by considering the all-atom native struc- 
ture and by identifying those pairs of amino acids whose 
atoms effectively overlap. In the criterion of the effec- 
tive overlap, the atoms are represented by spheres with 
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FIG. 2: Probability of staying in the native structure for the 
domains indicated. The lines correspond to the domains left- 
to-right as listed in the order at the top of the figure. The 
corresponding values of Tj are listed below in the same order. 



FIG. 4: The scenario of folding events for 127-ltit. The data 
points show average times (the average is over 200 trajecto- 
ries) to establish specific contacts corresponding to the se- 
quential distance of \j — i\ at the temperature of the optimal 
folding. The letter symbols indicate the nature of the strands 
that form the contacts. 




FIG. 3: Median folding times for the four systems as a func- 
tion of temperature. The arrows indicate values of the folding 
temperature Tf. All of these systems are good folders in the 
sense that the folding temperatures are in the region of good, 
if not necessarily optimal, folding. The error bars are of the 
order of the data points (the convention kept throughout the 
paper). 



radii that are a factor of 1.24 larger than the atomic 
van der Waals radii^ to account for the softness of the 
potential. The native contacts are then represented by 
the Lennard- Jones potentials 4e[(tTy/r i:( -) 12 — (cy/r^-) 6 ], 
where is the distance between C Q atoms i and j. The 
length parameters <7jj are determined so that the min- 
imum of the pair potential coincides with the distance 
between C a atoms in the native structure. In order to 
prevent entanglements, the remaining pair-wise interac- 
tions, i.e. the non-native contacts, correspond to a pure 
repulsion. This is accomplished by taking the Lennard- 
Jones potential with cry = a = 5 A and truncating it at 
2 1/6 cr. 

All contacts have the same energy scale e. This en- 
ergy scale corresponds to between 800 and 2300 K as it 
represents effectively hydrogen bond and hydrophobic in- 
teractions so the room temperature should correspond to 
T = ksT/e of about 0.1 - 0.3 (&b is the Boltzmann con- 
stant). The specificity corrections could be implemented 
if known reliably. Replacing the Lcnnard-Jones potential 
in the contacts by a 10-12 interaction yields equivalent 
results both in folding^ and in stretching^ An improved 
simulation might involve enhancing the strength of con- 
tacts that correspond to a disulfide bridge that is present 
in the II domain. 

Neighboring C Q atoms are tethered by a harmonic po- 
tential with a minimum at 3. 8 A and the force constant 
of lOOeA -2 . A four-body term that favors the native 
sense of the local chirality and thus facilitates forma- 
tion of local structure in a proper way is also kept in 
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the Hamiltonian,- 1 This term vanishes for the native-like 
chirality and introduces an energy penalty of order e for 
the opposite chirality^. 

A Langevin thermostat with damping constant 7 is 
coupled to each C Q to control the temperature. For the 
results presented below 7 = 2m/r, where r = yjma 2 Je ~ 
3ps is the characteristic time for the Lennard- Jones po- 
tential. This produces the overdamped dynamics appro- 
priate for proteins in a solvent^ but is roughly 25 times 
smaller than the realistic damping from water^ Previ- 
ous studies show that our choice speeds the kinetics up 
without altering behavior, and tests with larger 7 con- 
firm a linear scaling of folding times with 7^Li°i Thus 
the folding times reported below should be multiplied by 
25 for comparison to experiment. 

The stretching protocol follows the reference^ and is 
implemented parallel to the initial end-to-end vector of 
the protein and both ends of the protein are attached 
to harmonic springs of spring constant k. We consider 
the "soft" spring case of k = 0.12e/A 2 which corre- 
sponds to typical elastic constants of AFM cantilevers. 
The outer end of one spring is held fixed at the origin, 
and the outer end of the other is pulled at constant speed 
v p which results in a displacement d away from the lo- 
cation at the initial time. We focus on v v = 0.005A/t 
which corresponds to a velocity of about 7 x 10 6 nm/s 
when 7 = 2m/ t. Experimental AFM velocities range 
from 0.3 to 10 000 nm/s^i^S^ whereas all atom simula- 
tions correspond to speeds which are at least six orders 
of magnitude highersS This large speed used in the all- 
atom simulations is believed to be one of the reasons 
for the peak forces that are a factor of 10 bigger than 
found experimentally (another, and probably more im- 
portant, could be working against the surface tension of 
the droplet of water that surrounds the model 127 domain 
of titin). 

The folding and stretching processes are characterized 
by the order in which native contacts are formed and 
broken respectively. The complication is that, at a finite 
T, a pair distance may fluctuate around a selected cut- 
off value. Thus, when discussing folding, we determine 
the average time t c for each contact to form for the first 
time. On the other hand, when discussing stretching, 
we determine the average displacement, d u , at which a 
contact holds for the last time. The presence of a contact 
between amino acids i and j is declared when nj does 
not exceed 1.5ct„. 



IV. RESULTS AND DISCUSSION 

Folding of single domains of titin 

Figure 2 shows the equilibrium probability, Pq, of stay- 
ing around the native state as a function of T. The crite- 
rion for this is that each pair that forms a native contact 
does not exceed the cut-off distance. Po depends on T in 
a sigmoidal fashion and the folding temperature, X 1 /, cor- 
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responds to Po crossing 0.5. The values of X 1 / do not vary 
much: they range between 0.19 and 0.23. The smallest 
value is for 127-ltit and the largest for Il-lglc, suggest- 
ing that domain II, at least when in isolation, should be 
more stable than 127. The value of X/ for Il-model is 
0.21 - it is close to Il-lglc but clearly the two values 
are not identical. 128-model appears to be more thermo- 
dynamically stable than 127 which is at variance to the 
experimental findings. 17 

In order to characterize the folding kinetics we start the 
system in an unfolded conformation and determine the 
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FIG. 7: Force-displacement curves for the four structures at 
T = 0. 
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FIG. 8: Same as in Fig. 4 but for T—0.3 which should corre- 
spond to the room temperature situation. The pulling force is 
averaged over lOOr, i.e. over the distance of 0.5 A, to reduce 
the random noise. 
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FIG. 9: The scenario of mechanical unfolding at T=0 for 
127-ltit. The data points show the last distance at which 
particular contacts are considered to be still holding. The 
letter symbols are as in Figure 4. 



side of this region and on the low temperature side. The 
optimal values of i/ ;<i indicate that the structure 127- 
ltit is the hardest to fold to. Despite the difference in 
the T dependence between Il-model and Il-lglc the two 
systems have nearly the same optimal folding times. 

Figures 4 through 6 show that the four systems 
(results for Il-model not shown) also differ in their 
folding scenarios as represented by plots of t c vs. the 
contact order, \j — i\, at the temperature of the fastest 
folding. Il-model shows the most monotonic dependence 
of t c on \j — i\. However, its counterpart, Il-lglc, 
concludes its folding by establishing mid-range contacts. 
The same happens for the remaining structures but the 
nature of the last contacts is specific to the structure: 
it is the C strand joining F in the case of 127-ltit and 
the B strand joining E in the case of 128-model. Each 
scenario involves nearly parallel branches of events 
taking place at various times for the same contact order. 
The four systems studied here are clearly not equivalent 
kinetically and, in particular, Il-model is not equivalent 
to Il-lglc. 

Stretching of single domains of titin 



median "first passage time", tf id, i.e. the first time to 
establish all native contacts. The T dependence oitfdd is 
shown in Figure 3. The two homology-determined struc- 
tures have a broad region of fast folding and Tf is within 
this region. On the other hand, the experimentally de- 
termined structures correspond to a significantly reduced 
width of the region of best folding with Tf being just out- 



The dependence of the pulling force, F, on the dis- 
placement for the four structures is shown in Figures 7 
and 8 for T=0 and 0.3 respectively. All of the four struc- 
tures have quite similar F — d patterns. The maximum 
force peak is large and in all cases it comes early during 
the unfolding providing the main resistance of the struc- 
tures to the pulling device. The maximum force peak is 
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FIG. 10: Same as in Figure 9 but for 128-model. 



FIG. 12: Same as in Fig. 9 but for T=0.3. The data points 
are averaged over 20 trajectories. 
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FIG. 11: Same as in Figure 9 but for Il-lglc. 



the highest for Il-lglc at both temperatures. This in- 
dicates that when standing alone II is the most stable 
domain to sustain pulling force. The heights of the max- 
imum forces of other three structures are very close to 
each other. At both temperatures, Il-model appears to 
be mechanically less stable compared to Il-lglc. 

The second maximum in the patterns shows more vari- 
ations. The details of its shape varies between the do- 
mains and its height is noticeably bigger for the two ver- 
sions of the II domain. Unlike the first maximum, the 
second maximum builds up at a location which differs 



somewhat from structure to structure. The third maxi- 
mum is practically absent in the case of 127-ltit but it ex- 
ists, in differing shapes, in the remaining three structures. 
The differences in the shapes and positions of the max- 
ima are related to different sets of contacts that are in- 
volved. We note that, mechanically, Il-model appears to 
be nearly indistinguishable from the experimental struc- 
ture Il-lglc. Overall, the differences in the F — d curves 
between the studied domains are subtle so the domains 
should be functionally equivalent in stretching. 

The differences between the structures diminish as the 
temperature is raised and at a sufficiently high tempera- 
ture, of order 0.8 in this case, the F — d curves switch to 
a worm-like chain featureless behavior— in which F just 
grows with d monotonically combined with small thermal 
fluctuations. 28 ! 29 

The scenarios of the contact rupture at T—0 and 0.3 
are shown in Figures 9 through 11 and 12 through 14 
respectively. Sets of data points that form nearly hori- 
zontal lines correspond to the maxima in the F—d curves. 
The nature of the first maximum is very similar for all 
of the structures studied and it involves rupturing of the 
A'-G contacts. The first to rupture are the strictly ter- 
minal A-G bonds but this process generates no big force. 
There are also varying contributions from the A'-B, A-B, 
B-G and F-G contacts. The second maximum involves 
separating strand C from F and a varying number of 
contacts between strand B and E. The third maximum 
(not present for 127-ltit) involves breaking of contacts 
between strands D and E. 

Marszalek et al— have identified a folding interme- 
diate in 127 - a hump on the F — d curve that pre- 
cedes the maximum force - as being due to the rupture 
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of two hydrogen bonds in A-B. This identification re- 
lies on the AFM technique combined with the steered 
molecular molecular dynamics simulations and on mak- 
ing an amino-acidic substitution on the sixth position. 
Our T=0 results suggest that most of the A-B bonds 
(the filled triangles in Figures 9 and 12; in Figure 9 these 
symbols overlap with other and, except for one, are hard 
to see) break simultaneously with those of A'-G and one 
(between 4 and 23) precedes the rupture of A'-G. At 
T = 0.3, on the other hand, the bonds of A'-G and A-B 
rupture almost simultaneously. It should be noted that 
the coarse grained nature of the Go-like model does not 
allow to identify breaking of hydrogen bonds. The hydro- 
gen bonds between the A and B strands arise probably 
at 5-24 and 6-24. Since we use relatively large cut-off 
distances for contact breaking, these contacts are still 
present when the 4-23 bond is ruptured. However, the 
relative displacement of strand A with respect to strand 
B should deform the hydrogen bonds. We conclude that 
our advance rupture of the 4-23 bond observed in the low 
temperature data can be related to the hydrogen bond 
rupture discussed in ref^i. Thus our results arc in a 
qualitative agreement with the previous finding. 

As the temperature is raised from T=0 to 0.3, the 
contact rupturing scenarios simplify somewhat: the 
events the contacts are more clearly grouped into several 
horizontal lines corresponding to the maxima in the 
force. For instance, the breaking of the A-G contacts at 
T=0.3 takes place almost at the same time as that of the 
A'-G bonds. The heights of the force peaks, however, 
decrease due to thermal fluctuations. On increasing the 
temperature further, the stretching scenarios become 
even more simplified. At T=0.8, i.e. in the entropic 
limit, they become strictly monotonic as a function of 
the contact order, as illustrated in Figure 15. In this 
limit, the four systems are strictly indistinguishable 
from the mechanic point of view. 

Stretching of several domains of titin 

We now generate tandem arrangements of five identi- 
cal domains of titin. The T=0 F — d patterns are shown 
in Figure 16. To a good approximation, the patterns 
are a serial combination of the single domain patterns of 
Figure 7: the proteins unwind domain by domain. The 
reason for this serial unwinding is that for a single domain 
the largest force peak is located early in the unraveling 
process. However, the larger the temperature, the more 
parallelism in the unravelling 28 . The small shoulder be- 
fore the first peak points out that all domains unfold 
simultaneously to a metastable state in which contacts 
between all terminal strands A and G and one contact 
(4-23) between all strands A and B break before the first 
domain unfolds completely. As we discussed above, this 
metastable state for 127 domain has been also identified 
as the intermediate state2I. Note that in RefiSi the au- 
thors did not consider the breaking of the A-G contacts 
as being an ingredient of the intermediate state. We have 
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FIG. 13: The scenario of stretching for 128-model at T = 0.3. 
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FIG. 14: The scenario of stretching for Il-lglc at T = 0.3. 



found that a similar phenomenon takes place for the two 
versions of the II domain and for 128-model. In all cases, 
the unfolding intermediate involves breaking of contacts 
between strands A and G and some contacts between 
strands A and B. We observe that the extension at which 
this happens is smaller in II and 128 than in 127. Fig- 
ure 16 also shows that the 'hump' before the first peak is 
somewhat milder in the other domains than in 127. Note 
that in RefiSi the 'hump' is also observed for a tandem 
arrangement of 128. 

On increasing the temperature, the simultaneous un- 
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FIG. 15: The scenario of stretching for 127-ltit at T=0.8. 




FIG. 16: Stretching of five domains of titin, linked in a tandem 
arrangement as listed in each panel, at T=0.0. 



tures. The reason for this is that the second maximum 
in 127-ltit and the third maximum in other structures 
are barely stable. As unfolding of the domains contin- 
ues, thermal fluctuations (or kinetic energy) associated 
with the already released chain length gain in strength 
and destroy the peaks with low stability. Figure 17 shows 
that, for tandem arrangements of 128-model (the second 
panel) and Il-lglc (the third panel), the second peak 
becomes increasingly weak. It actually disappears in the 
last segment of the sawtooth pattern for the case of 128- 
model. 

It can also be seen in the top three panels of Figure 
17 that the heights of the maxima in the sawtooth pat- 
terns are the highest for the Il-lglc domains. Further- 
more, the first four maxima for the 128-model domains 
are a bit weaker than those of 127-ltit even though they 
were of about the same height for the single domains. 
This indicates that the admixture of parallel unwinding 
that occurs in tandem arrangements affect mechanical 
stability of the individual domains. In order to study 
the stability differences of the domains better we have 
constructed heterogeneous tandem arrangements of the 
structures. In this case the less stable domains will un- 
fold before the more stable ones. The two bottom panels 
in Figure 17 shows the F — d curves for two heteroge- 
neous tandem arrangements of five domains at T=0.3. 
Specifically, 3 domains of 127-ltit are linked, in an al- 
ternating fashion, with two domains of 128-model or two 
domains of Il-lglc. We find that Il-lglc is the domain 
that unravels the last, i.e. is the most stable. 128-model, 
on the other hand, shows a more complex behavior. We 
find that when placed in tandem with 127-ltit, 128-model 
is the domain that unravels the first, but after the un- 
raveling of the first domain, the two types of domain can 
alternate - the order of unfolding is determined by fluc- 
tuations. This indicates that, in tandem arrangements, 
128-model is only slightly less stable than 127 and the 
difference in mechanical stability between these two do- 
mains becomes insignificant comparing to thermal fluc- 
tuations after the first domain unfolds. This observation 
is consistent with the experimental findings^ As an ex- 
ample, the curve shown in the last panel of Figure 17, 
corresponds to unraveling proceeding in the order: 128, 
127, 127, 127, and 128. Figure 18 shows the snapshots 
of the five-domain tandem arrangements corresponding 
to d=500 A and T=0.3 and illustrates the domain-by- 
domain character of the unfolding in each case. 



winding of the domains to the intermediate state still 
holds but also thermal fluctuations gain in importance 
and affect the patterns significantly. Figure 17 shows the 
F-d patterns for various tandems of domains at T of 0.3. 
It is seen that it is only the first segment that repeats 
the single domain pattern fairly accurately whereas the 
remaining segments in the sawtooth-like pattern lose the 
last single domain maximum: there is no second maxi- 
mum in the fivefold repeat of 127-ltit and no third maxi- 
mum in the remaining fivefold repeats of the other struc- 



V. SUMMARY AND CONCLUSIONS 

In this paper, we have presented predictions of Go-like 
modeling for four native structures of titin domains and 
of their tandem arrangements and demonstrated that, 
despite noticeable differences in their native structures, 
the force-displacement plots and the unravelling events 
are qualitatively similar. Thus, the domains should be in- 
terchangeable in terms of their elastic properties. There 
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J . ltit 128 ltit 128 ltit T 



500 d [A] 1000 



FIG. 17: Stretching of five domains of titin, linked in a tandem 
arrangement as listed in each panel, at T=0.3. 




100 200 300 

d [A] 



100 200 300 

d [A] 



FIG. 19: An analog of Figure 6 at T=0.3 for a general- 
ized model in which the hydrophobic-hydrophobic contacts 
are made stronger by a factor of two. 



d=500 A 



T = 0.3 



5 x Hit 



5 x 128 



5 x 1g1c 



1tit1g1c1tit1g1c1tit 



Hit I28 1titl281tit 



FIG. 18: Snapshots of the five-domain arrangements when 
stretched by 500 A. The top-to-bottom ordering of the snap- 
shots corresponds to the one adopted in Figure 11. 



are also, however, substantial differences when one con- 
siders finer details of the elastic stability of the domains 
and their thermodynamic and kinetic properties when 
the domains are considered in isolation. Il-lglc is the 
most stable structure in terms of its elastic resistance 
both separately and when placed in tandem arrange- 
ments. The 128-model domain has a mechanical stability 
close to that of as single domain of 127, but it is clearly 
less stable in tandem arrangements that involve 127. 

More noticeable distinctions are observed for the ther- 
modynamic and kinetics properties of single domains. 
We have shown that Il-lglc and 128-model are the two 
most thermodynamically stable structures and 127- ltit is 
the least stable one. 128-model has a faster folding and 
the range of temperature in which folding is optimal is 
wider than 127; the same is observed for Il-model when 
compared to Il-lglc. The kinetics of refolding events as 
studied as a function of the contact order also show clear 
distinctions. The contacts that form the last, on aver- 
age, in the folding of 127-ltit are between strands C and 
F while for 128-model they are between strands B and 
E. In Il-lglc and Il-model, the last events involve still 
other sets of contacts. Only the earliest folding events 
are similar in all models. All these differences show how 
sensitive the folding properties of the models are to the 
precise details of the native structures. 

Most of our predictions about equivalence, or lack of 
it, of the domains of titin in the context of the mechan- 
ical and kinetic folding properties remains to be tested. 
Currently, the 127 domain has been studied experimen- 
tally in the most exhaustive way. The Go-like theoret- 
ical account of mechanical properties of 127 has turned 



10 



TABLE I: Summary of the properties of the theoretically determined structures corresponding to the 128 domain of titin as 
obtained within the Go model used in this paper. The first column shows a comparison to the similarly derived properties of 
127-ltit. The lower the value of Tf, the lower the thermodynamic stability. The values of the Z-score listed in the table are for 
all contacts. They represent the deviations from the average quality value as determined by using the program WHATIF with 
the NEWQUA option in the QUALITY menu. 





127-ltit 


128-model 


I28-A 


I28-B 


I28-C 


I28-D 


I28-E 


F max (T = 0) [e/A] 


3.84 


3.92 


3.50 


3.66 


4.08 


3.42 


3.48 


Tf 


0.185 


0.224 


0.174 


0.172 


0.192 


0.178 


0.214 




0.275 


0.30 


0.25 


0.275 


0.25 


0.275 


0.275 


tfold/T (T = Train) 


3700 


1730 


2200 


2040 


1800 


2480 


1970 


tfoid/r (T = 0.3) 


4320 


1730 


5440 


2680 


2600 


2540 


2060 


F?nax (.T 0) 


4.92 


5.06 


4.86 


4.30 


4.89 


4.32 


4.72 


Z-score 


-4.85 


-3.45 


-6.17 


-5.91 


-6.10 


-6.23 


-3.58 



"model with enhanced hydrophobic contacts 



out to be consistent with the experimental findingSi2L2S 
Some experimental results available for the 128 domain, 
however, appear to be at odds with our simulational re- 
sults, especially when one compares 128 to 127. First 
of all, our studies predict the stability of 128-model to 
be higher than that of 127-ltit whereas the experiment 
shows the opposite: the melting points for 127 and 128 
are 72°C and 35°C respectively^ Secondly, our folding 
simulations indicate that 128-model folds faster than 127 
whereas experiment shows that the folding rate of an 
isolated 128 domain is 0.025 s _1 , which is three orders 
of magnitude lower than that of 127 (32 s" 1 ) 3 Finally, 
the atomic force microscopy studies^ have shown that 
128 domains are mechanically more stable than 127 both 
in homo- and hetero-domain polyproteins whereas our 
studies point to nearly equivalent stabilities. This sug- 
gests that the method we are using cannot easily describe 
cases such as 128 whose structure is very unstable in so- 
lution. Any 'rigid' model which assumes a stable and 
compact structure would therefore not be appropriate to 
represent the experimental conditions. 

One source of the discrepancy could be that our sim- 
ple model incorporates just one uniform energy scale e 
whereas a more realistic modeling would involve hetero- 
geneous couplings. This expected lack of homogeneity 
might govern the subtle differences between the domains. 
In order to probe such effects, we have considered a gen- 
eralized Go-like model in which the interactions in con- 
tacts that link two hydrophobic amino acids (ILE, LEU, 
MET, VAL, PHE, TRP, and TYR) are enhanced by the 
factor of two. Figure 19 for T = 0.3 shows that, me- 
chanically, the relationships between the four domains in 
this two energy scale model are very much like as in the 
basic model. The noticeable difference though is that 
127-ltit acquires the more pronounced third peak in the 
force. (The models with the two-fold enhancement of 
the hydrophobic-hydrophobic contact strengths are inad- 
equate kinetically: they give rise to an easy misfolding.) 

Another source of the error then may be the struc- 
ture itself. The homology-based derivation of the struc- 



ture may not sufficiently refined for applications that in- 
volve dynamics. Some hints can already be inferred by 
considering the II domain. Our studies show that II- 
model and Il-lglc, both meant to represent the same 
system, have distinct kinetics and very different stabil- 
ities: the model structure has 259 contacts instead of 
288 and is less stable thermodynamically and mechan- 
ically than the experimentally derived structure. Even 
the force-displacement curves have distinct details indi- 
cating that a precise knowledge of the structure may af- 
fect prediction of dynamical properties in a substantial 
way. 

We have thus generated five more homology-based 
structures of the 128 domains, denoted as 128- A through 
I28-E. 128-model and I28-E are of the highest quality, as 
judged from the standard structure quality checks^ The 
summary of their elastic, thermodynamic, and kinetic 
properties, as determined within our Go-model, is given 
in Table I and compared to those of 127-ltit. There are 
three conditions that should be met for a model to agree 
with the experimental results: 1) 128 should be more sta- 
ble mechanically than 127, but 2) it should be less stable 
thermodynamically, and finally 3) it should refold sig- 
nificantly slower. An inspection of Table I indicates that 
none of the six 128 structures satisfies all three conditions. 
At best, two conditions are met in some structures. For 
instance, I28-C is the strongest mechanically of all of the 
128 structures and it is also stronger than 127-ltit. Its 
folding temperature, though not lower than that of 127- 
ltit, is nearly to it. However, I28-C folds faster both at 
T m im when folding proceeds the fastest, and at T=0.3 
which appears to correspond to the room temperature. 
Another good choice could be I28-A. This structure yields 
slower folding than 127-ltit at T=0.3 and is less stable 
thermodynamically. However, its peak force is substan- 
tially smaller than that for 127-ltit. Incompatibly with 
our predictions, the Z-scores of both I28-C and I28-A are 
low which means that these structures are not very good 
in terms of packing quality. The best Z-score among the 
new structures has I28-E, however this structure yields 
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a mechanical stability which is significantly lower than 
that of 127-ltit. 

This analysis indicates that the precise definition of 
the structure has a substantial impact on the predicted 
properties. Existence of a well defined and fairly rigid 
native structure is at the heart of applicability of the 
Go-like modeling. In the context of the 128 domain, 
however, this feature may also be a root of the prob- 
lem given that the experimentally determined melting 
temperature of 128 is only 35° C. Nevertheless our analy- 
sis illustrates a possibility that using a dynamical model 
may augment homology-based determination of protein 
structures. The final message is that the simple Go-like 



description can capture existence of differences in proper- 
ties between various domains of titin and of their various 
models. 
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